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Abstract. In his 2005 paper, S.T. Smith proposed an intrinsic Cramer-Rao bound 
on the variance of estimators of a parameter defined on a Riemannian manifold. In 
the present technical note, we consider the special case where the parameter lives in a 
Lie group. In this case, by choosing ,e.g., the right invariant metric, parallel transport 
becomes very simple, which allows a more straightforward and natural derivation of the 
bound in terms of Lie bracket, albeit for a slightly different definition of the estimation 
error. For bi-invariant metrics, the Lie group exponential map we use to define the 
estimation error, and the Riemannian exponential map used by S.T. Smith coincide, 
and we prove in this case that both results are identical indeed. 


1 Introduction 

The Cramer-Rao bound is a lower bound on the achievable precision of any unbiased esti¬ 
mator of a vector 8 which parametrizes a family of probability distributions p(X\6), from a 
sample Xi, ■ ■ ■ , X n . This bound is standard in classical estimation theory. Differential geom¬ 
etry considerations in statistics can be traced back to equivariant estimation m (see also 
[8] and references therein for a more recent exposure) and of course to the work by Fisher 
on the Information metric, and all the works that followed, notably in information geometry. 
The paper m proposes to derive an intrinsic Cramer-Rao bound for the case where the 
parameter lives in a Riemannian manifold. The two examples given are subspace estimation 
(that pertains to the Grassman manifold) and covariance matrix estimation (that pertains 
to the cone of positive definite matrices), both examples being related to signal processing 
applications. See also the nice extensions proposed since then by N. Boumal [HE], and our 
gentle introduction to the subject [2] for more details. 

Our motivating example is the so-called Wahba’s problem m, named after Grace Wahba, 
which is an optimization problem where the parameter is a rotation matrix, but which can 
also be viewed as the search for a maximum likelihood rotation estimator. The application 
invoked in m is satellite attitude determination. The derivation of more sophisticated atti¬ 
tude estimators has been the subject of a lot of research over the last decade, mainly driven 
by the burst of mini UAVs (unmanned aerial vehicles), especially quadrotors. The reader is 
referred to, e.g., [4] for examples. 

In the present paper we propose a general derivation of an intrinsic Cramer-Rao lower 
bound on Lie groups, that is similar to the one proposed by S.T. Smith on manifolds, that is, 
we retain terms up to the second order in the estimation error. The discrepancy between the 
estimation and the true parameter is naturally defined in terms of group operation (which 
makes it intrinsic). Thus, the bound differs from the Euclidean one because of the non¬ 
commutativity of the group operation, yielding some additional terms that are expressed 
thanks to the Lie bracket (or alternatively structure constants). It is interesting to note our 
result coincides with the result of S.T. Smith in the case where the metric is bi-invariant, as 
in SO(3). However, both formula disagree in the general case, as the definition of estimation 
error in terms of group multiplication differs from the intrinsic estimation error based on the 
Riemannian exponential proposed by S.T. Smith. 


2 An intrinsic Cramer-Rao bound on Lie groups 


We compute here the Intrinsic Cramer-Rao Lower Bound (ICRLB) on a Lie group G, up to 
the second order terms in the estimation error log(gg -1 ), where g £ G is the true value of 
the parameter and g £ G the estimate. 

2.1 Preliminaries 

Let G be a Lie group of dimension n. To simplify notations we assume G is a matrix Lie 
group. The tangent space at the Identity element Id, denoted g. is called the Lie Algebra of 
G and can be identified as R", that is 


0 


The (group) exponential map 


exp :g G 

£ exp(£), 

provides a local diffeomorphism in a neighborhood of Id. The (group) logarithmic map 

log : G g, 

is defined as the principal inverse of exp. For any estimator g of a parameter g £ G, it allows to 
measure the mean quadratic estimation error projected onto the Lie algebra (the error being 
intrinsically defined in terms of group operation, where the group multiplication replaces the 
usual addition in R") 

(log^W 1 )) 

where E denotes the expectation assuming X is sampled from P(X\g). The logarithmic map 
allows also to define a covariance matrix of the estimation error: 

p = (logGra(X)- 1 ) log(sff(X)- 1 ) T ) e R raxn . (l) 


2.2 Main result 

Consider a family of densities parameterized by elements of G 

p(X\g), X £ R fc , g £ G. 

Using the exponential map, the intrinsic information matrix J{g) can be defined in a right- 
invariant basis as follows: for any £ £ R", 

£, T J(g)Z = j (^ t _^ogp(X | expit^g)^ logp {X | exp (tf) g)J p(X \ g)dX, (2) 

and then J(g) can be recovered using the standard polarization formulas 

J{g)v = \ ((£ + v) T J{g) (£ + v)~ ?J(g)Z - v T Juj . 

Besides, using the fact that Jp (X | exp (i£) g) dX is constant (equal to 1), which implies 
0 = ^ f p ( X I exp (*0 9 )dX = J ^ logp {X | exp (t£) g)^jp(X | exp (t£) g) dX, (3) 


we have, differentiating equality © a second time w.r.t t and reusing that = p4: logp: 

0 = / (^2 lo &P ( X I exp {tO g)j P(X\ 9 ) dX 

+ j (| t _ o lo gp(X|exp(^) 5) ) logp (X | exp (tf) g)j p(X\g)dX, 

allowing to recover an intrinsic version of the classical result according to which the informa¬ 
tion matrix can be also defined using a second order derivative 

t, T J{g)Z = (jL\ og p(X\ exp (if) ff)) • 

Let g be an unbiased estimator of g in the sense of the intrinsic (right invariant) error gg -1 , 
that is, 

J r log [gg P0 _1 ) P (x\g) dX = 0. 

Let P be the covariance matrix of the estimation error as defined in 0. Our main result of 
this section is as follows 


Ph ( Id + -XP.H ) J(g) _1 


Id + — P.H 


(4) 


where we have neglected terms of order E g ^ 11 log [gg (X) y ll 3 J i anc l where H is the (1,3)- 
structure tensor defined by 

H {X, Y, Z) := [X, \V,Z]], 

and where P.H is the tensor contraction of P and H on the two first lower indices of H , 
defined by ( P.H) ki = P^^ljk- Using the structure constants of G defined by 


[e», ej] : ^ c%e k , ( ad ei ){ = 0 %, (5) 

k 

note that the components of H can be expressed by the equality 

^ fc = E c ™4- ( 6 ) 

i 


The latter result is totally intrinsic, that is, it is independent of the choice of the metric in 
the Lie algebra g. 

For small errors, we can neglect the terms in P on the right hand side (curvature terms) 
yielding the approximation which reminds the Euclidean case 


P =J 

f log 

s 

log 

[gg{xr l ) 

T i 

p {X\g) dX h J (g) + curvature terms 


2.3 Proof of the result 

Let g be an unbiased estimator of g in the sense of the intrinsic (right invariant) error gg~ x , 
that is, 


IX 


log [gg {X)- 1 )p{X\g)dX = Q. 





If we let £ be any vector of the Lie algebra and tsR, the latter formula holds with g replaced 
by exp (t£) g and X sampled fromp(X| exp (f£) g). Thus we have E exp ( t j) s ^log exp (i£) gg (X) -1 
0 for any t £ R. Differentiating this equality written as an integral over X we get 

^ J log exp (t^ggiXy 1 p (X| exp (i£) g) dX = 0. 

Formally, this implies at t = 0 

J^D log| flS(JC) -i [^(AT) -1 ] P (-%) + log (sffPO -1 ) -Dp (AT | •) | ff [tg]JdX = 0. (7) 

For any linear form u(-) of the Lie algebra g we have thus: 


' J x u {° log| g§w -i £,gg(X) 1 ^jp(X\g)dX 

= j x u[\og(gg{X)~ 1 )Dp{X | -M^dX 




]J^u(\og(gg(X)-i)) 2 p(X\g)dX^ (J^D\ogp(X | OU&I) 2 ?™) dX, (8) 


where we used the Cauchy Schwarz inequality and the relationships 

Dp = pD log p, and then p = (y/p) 2 . 


We then introduce a basis of g and the vector A(X) = log (gg(X) 3 ). According to ([2]) the 
right-hand integral in (jHD is J(g), which yields (u being assimilated to a vector of g) : 



lx 


Dlog\ g g(x)-i[Zxgg{X) l ]p(X\g)dX 


< u 


A(X)A(X) T p(X\g)dX 


IJX 


« u r ^. 


(9) 


where we added the subscript x to £, distinguishing the element £ x of g and the column vector 
£ containing its coordinates in the chosen basis. Now we compute a second-order expansion 
(in the estimation error) of the left-hand term. To do so, we note from the Baker-Campbell- 
Hausdorff (BCH) formula retaining only terms proportional to t 


log [exp (f£) Q] = log [exp (t£) exp (log(Q))] 

= t £~\ [ lo g(<2)^] + [log(Q), [log(Q), t£] -F O (|| log(Q)|| 3 )] t£. 

This gives the formula below: the second-order expansion in log(Q) of the derivative of the 
left-hand term w.r.t to t. Note that this approximation will be integrated over Q in equation 
ED and therefore a rigorous reasoning should prove the density p is small where the higher- 
order terms become larger. Here we assume p is sufficiently peaked for this approximation to 
be valid. 

Dl og q[(0x< 9] = [/- ^odiog(Q) + ^adf og(Q) ]C (10) 

Moreover we have by linearity (£ is here deterministic): 

E 9 Q [l°g (gg(X)- 1 ) , t£] ^ i [E g (log (gg(X)- 1 )) ,t£\ = 0, 











and also 


Eg[z, [x,(]} = ^2H l ijk E g (x i x j )£ k ei = G°£, 

where G° is a matrix whose entries are functions of E g (xx T ) and the structure constants 
(see ([5])): G° is defined by G° = P.H , i.e. G k l = JW P 1 -' H\- k with H defined as in ©. We 
introduce the latter second-order expansion in the error in equation |9l) : 



__ tP 

l2 aa Mx) 


p(X\g)dX( 


< u 


A(X)A(X) T p(X\g)dX 


IJ X 


u £, T J(g)t 


(ii) 


Letting P = f x A(X)A(X) T p(X\g)dX we get: 


u- i/+^G u (p))e 


1 2 


< ( u T Pu) (ej(g)0 


Replacing £ with the variable £ = J(g) 1 (I + ^LG°(P)) u directly allows to prove that 


P^ 


1 + ^G U (P)1 J( 5 )-Ml + i-G 0 (P) 


12 


12 


( 12 ) 


where 

G°(P) = E 9 [log {gg- 1 ) , [log (ss" 1 ) , ■]] ■ (13) 

Remark 1 If the model is equivariant, i.e., verifies V/i € G,p (/iX = p(X\g) (see 

W), the study can be restricted to equivariant estimators (estimators verifying g{hX) = 
gi^X)^ 1 ). In this case equation (1131) simplifies: 


G°(P) = / [log ( gg(X )" 1 ), [log ( gg(Xy 1 ), •]] p(X\g)dX 
■) x 

= [ ^og (gigXy 1 ) , [log (g(gX)- 1 ) ,-]]p(gX\Id)dX 
J x 

= / [log ( g(X' r 1 ) , [log {g(X')- 1 ), •]] p{X'\Id)dX', 

JX' 

where the integration variable X has been replaced by X' = gX in the latter equality. Thus if 
the model is equivariant, the Cramer-Rao bound is constant over the Lie group. 


3 Links with the more general Riemannian manifolds case 

In the paper m, the author derives the following intrinsic Cramer-Rao bound (see Corollary 
2). Assume 6 lives in a Riemannian manifold and 6 is an unbiased estimator, i.e. E g (exp^ 1 0) = 
0 where exp is the geodesic exponential map at point 9 associated with the chosen metric. 
The proposed ICRLB writes (up to higher order terms) 

P := E 9 (exp^ 1 6) (exp^ 1 h J(^)” 1 - ^ (P m (P) J^)" 1 + J(0)" 1 P m (P)) , 

where for sufficiently small covariance norm the matrix P m (P) can be approximated by the 
quadratic form 


12 —> ( Rm{P)L2 , 12) = Eg (^R ^exp e 1 9,12^ 12, exp e 1 9 ^ , 










where R is the Riemannian curvature tensor at 8. 

The considered error log(gg~ 1 ) being right-invariant, we can assume g = Id to compare 
our result to the latter. We then see, that up to third order terms, formula m coincides 
indeed with the result of El if 

G°(P) = -4 R m (P), 

where 

(Rm(P)t £} = E s (R (log ( gg(X )" 1 ) , £) & log ( gg(X )" 1 )) , 

and <G°(P)£, 0 = ([log ( gg(Xy 1 ) , [log ( gg{Xy 1 ) , £]] , 0 • 

If the metric is bi-invariant, as is the case for G = SO(3), we recover the result of [IT) . 
Indeed, for bi-invariant metrics on Lie groups, the Riemannian curvature tensor satisfies for 
right-invariant vector fields (see ,e.g., PQ) 

R(X,Y)Z = ~[[X,Y],Z]. (14) 

The question of comparing both formulas boils down to proving for Z random vector s.t. 
E(Z) = 0 that E g (£, [Z, [Z,£]]) = — 4E g (R(Z 1 £)£, Z). It can be verified as follows: 

-4E g {R(Z, Z) = E g {[[Z, ZU],Z)= E g {[Z, [.Z , £]],■£> = E s <£, [Z, [Z, £]]> 

where the second equality stems from a property of the bi-invariant case (e.g., m) that 
generalizes the mixed product on SO( 3) property, namely (\X,Y],Z) = {[Z,X],Y). 

3.1 Differences 

If the metric is not bi-invariant the results are different. This is merely because then the Lie 
group exponential map and the Riemannian exponential map are not the same. Thus, our 
definition of the estimation error differs, so it is logical that the results be different. 

4 Conclusion 

In this paper, we have proposed an intrinsic lower bound for estimation of a parameter that 
lives in a Lie group. The main difference with the Euclidean case is that the estimation 
error between the estimate and the true parameter underlying the data is measured in terms 
of group multiplication (that is, the error is an element of the group). This is a much more 
natural way to measure estimation errors, and it has been used in countless works on statistical 
estimation and filtering on manifolds. But it comes at a price of additional terms in the bound, 
that are not easy to interpret. The bound is not closed-form, but by retaining only first and 
second order terms in the estimation error, we ended up with a closed form bound, that is the 
covariance of any unbiased estimator is lower bounded by the inverse of the intrinsic Fisher 
information matrix, as in the Euclidean case, plus additional terms that are functions of the 
covariance. Moreover, the nice structure of Lie groups allows straightforward calculations, 
and the bound is expressed with the help of the Lie bracket, this, in turn, being related to 
the sectional curvature at the identity, helping to draw a link with the general result of S. T. 
Smith on Riemannian manifolds m- 

This note generalizes previous calculations [3] obtained on SO (3), in the context of attitude 
filtering for a dynamical rigid body in space, the latter Cramer-Rao bound being compared 
to the covariance yielded by the intrinsic Kalman filter of 0]. It would be interesting to apply 
the obtained general bound to pose averaging (that is on SE(3)), as ,e.g., proposed in [T2] . 


which could be then attacked by means of intrinsic stochastic approximation as in ,e.g., ET ; 
or [5J. 

Another future route could be to derive an intrinsic Cramer-Rao bound on homogeneous 
spaces. It is interesting to note that both examples of m are homogeneous spaces. Moreover, 
the results on manifolds are bound to be local, but we can hope for results on Lie groups and 
homogeneous spaces with a large domain of validity (if not global). 
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